Redundancy of the Lempel-Ziv incremental parsing rule

نویسنده

  • Serap A. Savari
چکیده

The Lempel-Ziv codes are universal variable-tofixed length codes that have become virtually standard in practical lossless data compression. For any given source output string from a Markov or unifilar source, we upper-bound the difference between the number of binary digits needed to encode the string and the self-information of the string. We use this result to demonstrate that for unifilar or Markov sources, the redundancy of encoding the first n letters of the source output with the Lempel-Ziv incremental parsing rule (LZ'78), the Welch modification (LZW), or a new variant is O((lnn)-'), and we upper-bound the exact form of convergence. We conclude by considering the relationship between the code length and the empirical entropy associated with a string. Zndex TermsLempel-Ziv codes, Markov sources, unifilar sources, renewal theory.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Simple Technique for Bounding the Pointwise Redundancy of the 1978 Lempel-Ziv Algorithm

Abstract: If x is a string of nite length over a nite alphabet A, let LZ(x) denote the length of the binary codeword assigned to x by the 1978 version of the Lempel-Ziv data compression algorithm, let t(x) be the number of phrases in the Lempel-Ziv parsing of x, and let (x) be the probability assigned to x by a memoryless source model. Using a very simple technique, we prove the pointwise redun...

متن کامل

Universal coding of nonstationary sources

In this correspondence we investigate the performance of the Lempel–Ziv incremental parsing scheme on nonstationary sources. We show that it achieves the best rate achievable by a finite-state block coder for the nonstationary source. We also show a similar result for a lossy coding scheme given by Yang and Kieffer which uses a Lempel–Ziv scheme to perform lossy coding.

متن کامل

Memoryless nonlinear system identification with unknown model order

Combining a note by Rissanen and an idea of enumerative coding we obtain a new implementation of the Ziv-Lempel incremental parsing algorithm for coding and decoding discrete data sequences. Index Terms -Ziv-Lempel algorithm, enumerative coding.

متن کامل

A Randomness Test Based on T-Complexity

We propose a randomness test based on the T-complexity of a sequence, which can be calculated using a parsing algorithm called Tdecomposition. Recently, the Lempel-Ziv (LZ) randomness test based on LZ-complexity using the LZ78 incremental parsing was officially excluded from the NIST test suite in NIST SP 800-22. This is caused from the problem that the distribution of P-values for random seque...

متن کامل

Lempel-Ziv Dimension for Lempel-Ziv Compression

This paper describes the Lempel-Ziv dimension (Hausdorff like dimension inspired in the LZ78 parsing), its fundamental properties and relation with Hausdorff dimension. It is shown that in the case of individual infinite sequences, the Lempel-Ziv dimension matches with the asymptotical Lempel-Ziv compression ratio. This fact is used to describe results on Lempel-Ziv compression in terms of dime...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IEEE Trans. Information Theory

دوره 43  شماره 

صفحات  -

تاریخ انتشار 1997